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Abstract 

Background: HBV genotype F is primarily found in indigenous populations from South America and is classified in 
four subgenotypes (F1 to F4). Subgenotype F2a is the most common in Brazil among genotype F cases. The aim of 
this study was to characterize HBV genotype F2a circulating in 16 patients from Sao Paulo, Brazil. Samples were 
collected between 2006 and 2012 and sent to Hospital Israelita Albert Einstein. A fragment of 1306 bp partially 
comprising HBsAg and DNA polymerase coding regions was amplified and sequenced. Viral sequences were 
genotyped by phylogenetic analysis using reference sequences from GenBank (n=198), including 80 classified as 
subgenotype F2a. Bayesian Markov chain Monte Carlo simulation implemented in BEAST v.1.5.4 was applied to 
obtain the best possible estimates using the model of nucleotide substitutions GTR+G+I. 

Findings: It were identified three groups of sequences of subgenotype F2a: 1)10 sequences from Sao Paulo state; 
2) 3 sequences from Rio de Janeiro and one from Sao Paulo states; 3) 8 sequences from the West Amazon Basin. 

Conclusions: These results showing for the first time the distribution of F2a subgenotype in Brazil. The spreading 
and the dynamic of subgenotype F2a in Brazil requires the study of a higher number of samples from different 
regions as it is unfold in almost all Brazilian populations studied so far. We cannot infer with certainty the origin of 
these different groups due to the lack of available sequences. Nevertheless, our data suggest that the common 
origin of these groups probably occurred a long time ago. 
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Findings 

Despite the availability of an effective vaccine, more than 
350 million people worldwide are chronically infected 
with hepatitis B virus (HBV), and many people have de- 
veloped severe liver diseases, such as cirrhosis and hepa- 
tocellular carcinoma (HCC) [1]. HBV has a partially 
double-stranded DNA genome composed of approxi- 
mately 3.2 kb, with four overlapping open reading 
frames (ORF): preS/S, precore/core, X and polymerase) 
[2]. As a result of its unusual mechanism of replication 
by reverse transcription and the lack of proofreading ac- 
tivity of this enzyme, it displays a highly heterogeneous 
sequence diversity. 
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Nine HBV genotypes (HBV/A to HBV/I) and a contro- 
versial HBV/J have been reported based on the differences 
in full-length genome sequences [3]. Genotype A (Al to 
A7) is highly prevalent in sub-Saharan Africa (subgenotype 
Al), Northern Europe (subgenotype A2), and Western 
Africa (subgenotype A3). Furthermore, both A4 and po- 
tential A5 strains were found in sub-Saharan Africa 
(Mali, Gambia and Nigeria) [4,5]. Recently, A5 was also 
found in African descendants in Haiti [6]. Proposed 
subgenotype A6 include strains from African- Belgian 
patients originating from Congo and Rwanda [7]. 

HBV/B and HBV/C genotypes are common in Asia. 
Currently, genotype B is divided in subgenotypes Bl to 
B6. Among them, Bl is found mainly in Japan, B2-5 in 
East Asia, and B6 in indigenous populations living in the 
Arctic, such as Alaska, Northern Canada and Greenland 
[8]. HBV genotype C (CI to C5) is found throughout the 
eastern and southeastern portions of Asia and the Pacific 
islands (Micronesia, Melanesia, and Polynesia), as well 
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2% Divergence 

Figure 1 The Maximum Clade Credibility (MCC) tree was estimated by a Bayesian analysis of 198 S/POL sequences of hepatitis B virus 
strains. The posterior probabilities of the key nodes are shown above the respective nodes. The HBV/F2a samples obtained from Sao Paulo 
(n=19) were analyzed together with other strains from around the world. Brazilian sequences from North Region (green), Northwest Region 
(blue), Southeast Region (red) and Center West Region (purple) were highlighted in tree. The clusters containing the strains of other HBV 
genotypes collapsed. 



as in immigrants from these areas in the United States, 
Europe, Australia, New Zealand [9] and Brazil [10]. 
HBV/D with subtypes D1-D5 is prevalent in the Medi- 
terranean region expanding through South Asia up to 
India [8]. HBV/E is restricted to West Africa [11] but 
some studies reported the presence of this genotype in 
South America [12] and India [13]. This genotype is 
characterized by a low genetic diversity among strains 
from throughout the world [14]. Genotype G has been 
reported in USA and France [15]. HBV/F and HBV/H 
genotypes are found in the Americas, with genotype F 
described from Alaska to Argentina, while genotype H 
has been mainly described in Central America and 
Mexico [16]. 

HBV genotypes F and H are considered indigenous to 
the American continent [15,17,18]. An HBV strain iso- 
lated from woolly monkey (a New World monkey) is 



closer to genotype F but with a noteworthy phylogenic 
distance from other strains of HBV [19]. Several strains 
of genotype F have been isolated in different countries in 
America, especially among Amerindian population and 
four subgenotypes have been described with genetic di- 
vergence among 4.3-6.1% [20]. 

HBV genotypes and subgenotypes have distinct geo- 
graphical distribution and it is currently discussed if they 
are associated with different prognosis considering re- 
sponse to antiviral treatment and the severity of liver dis- 
ease in different populations. Nevertheless, global human 
migrations affect the pattern of HBV genotypes distribu- 
tion, introducing genotypes differing from those found in 
the original inhabitants. 

HBV genotypes A, D and F are the most frequent in Sao 
Paulo [21], but there are not any published data on 
subgenotypes found there. It was previously reports that 
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Figure 2 Geographic location of HBV sequences from Brazil: North Region (3 AP - Amapa State, 5 RO - Rondonia State, 26 AM - 
Amazonas State, including 5 from Manaus, the capital city, 5 from the Apurinas people and 16 WAB from Western Amazon Basin 
(Apurinas and WAB samples were obtained from riverside communities living near Purus River); Northwest Region (5 PE -Pernambuco 
State); Southeast Region (11 RJ - Rio de Janeiro State and 19 SP - Sao Paulo State) and Center West Region (2 MT - Mato 
Grosso State). 



subgenotype F2 is the most frequent in Brazil [22], particu- 
larly subgenotype F2a, as latter subclassified by Devesa 
et al [20]. In this study, we compare HBV subgenotype F2a 
sequences recently obtained in Sao Paulo from chronic 
hepatitis B patients with other sequences previously de- 
scribed around the world with the aim to determine the 
origin of subgenotype F2a circulating in Sao Paulo, Brazil. 



Nineteen samples from chronically infected patients 
(seventeen male and two female) with HBV subgenotype 
F2a were obtained between 2006 to 2012 at Albert Einstein 
Diagnostic Medicine. The study was approved by the re- 
search Ethics committee of Albert Einstein Hospital (CEP/ 
Einstein No 10/1293) and an informed consent for partici- 
pation in the study was obtained from each patient. 
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These samples were initially subgenotyped using a system 
developed in the Clinical Laboratory [23] and were then 
selected for a longer fragment (1306 bp) amplification 
using other previously described primers [18]. 

For viral DNA extraction from 200 \A serum, it was used 
a commercial available kit (QIAamp mini, QIAGEN, 
Germany), according to the manufacturers instructions. 
Precipitated DNA was resuspended in 200 \A of elution 
buffer and stored at -20°C until use. To avoid false-positive 
results, strict procedures proposed for nucleic acid 
amplification diagnostic techniques were followed [24]. The 
amplified DNA was purified using ChargeSwitch® PCR 
Clean-Up kit (Life Technologies, Sao Paulo, Brazil). 
Sequencing was performed in an ABI 3500 Genetic 
Analyzer (Life Technologies, Foster City, CA, USA) using 
dideoxynucleoside triphosphates (ddNTPs) containing 
fluorescent markers (Big Dye® Terminator v3.1 Cycle 
Sequencing Ready Reaction kit - Life Technologies, Foster 
City, CA, USA). The quality of each electropherogram was 
evaluated using the Phred-Phrap software [25] and consen- 
sus sequences were obtained by alignment of both se- 
quenced strands (sense and antisense) using CAP3 software 
available at the web page Eletropherogram quality ana- 
lysis Phred (http://www.biomol.unb.br/phph/). 

HBV sequences were aligned with other previously 
reported genome sequences (n = 198) using ClustalX soft- 
ware [26] and was edited with the SE-AL software (available 
from: tree.bio.ed.ac.uk/software/seal/). The Bayesian Mar- 
kov chain Monte Carlo simulation implemented in BEAST 
v. 1.4.8 [27] was applied to obtain the best possible estimates 
using both the relaxed uncorrelated log norma i and exponen- 
tial molecular clock models and using the model of nucleo- 
tide substitution (general time-reversible + gamma + 
proportion invariant). The molecular clock that best fits the 
data was chosen by Bayes factor comparison. After 20 mil- 
lion generations, the maximum clade credibility (MCC) 
tree was obtained by summarizing 20,000 substitution trees 
and was then modified using a 10% burn-in using Tree 
Annotator v.1.5.3 [27]. Phylogenetic trees were visualized 
and midpoint rooted in FigTree vl.2.2 (tree.bio.ed.ac.uk/ 
software/figtree/) . 

A phylogenetic tree constructed with the 1,306 bp se- 
quences partially comprising HBsAg and DNA polymerase 
coding regions (S/POL) (n = 198) is shown in Figure 1, in- 
cluding the 19 new sequences obtained by our group from 
Sao Paulo state. Bayesian analyses were performed to try to 
determine the grouping of the genotype F2a sequences 
available. The complete dataset included 71 sequences from 
Brazil: North Region (3 AP - Amapa State, 5 RO - 
Rondonia State, 26 AM - Amazonas State, including 5 from 
Manaus, the capital city, 5 from the Apurinas people and 
16 WAB from Western Amazon Basin (Apurinas and 
WAB samples were obtained from riverside communities 
living near Purus River); Northwest Region (5 PE - 



Pernambuco State); Southeast Region (11 RJ - Rio de 
Janeiro State and 19 SP - Sao Paulo State) and Center West 
Region (2 MT - Mato Grosso State) (Figure 2). 

The phylogenetic tree showed three groups containing 
sequences from the same state that grouped with a high a 
posteriori probability: a branch of sequences from the 
WAB, including Apurinas samples, all from the North Re- 
gion and two branches of sequences from the Southeastrn 
regions, one containing mostly sequences from Rio de 
Janeiro State and another with 10 sequences from Sao 
Paulo State. The other sequences did not show a pattern 
organized by Brazilian states or regions. 

Currently, subgenotype F2a has been mostly found in 
Venezuela and it has been considered as originating from 
the Amazon region, this subgenotype has been spreading 
through the different entries in different regions so diverse. 

We used all of genotype F sequences available in the 
GenBank and applied a coalescent-based approach. Ten 
HBV F2a sequences of patients from Sao Paulo were 
grouped in one cluster. It was not found specific relation 
or similar characteristics among these 10 patients when 
compared with the other nine sequences from Sao Paulo. 

Since most of the subgenotype F2a sequences are from 
Brazil and it is not reported from which state each se- 
quence was obtained, it was not possible to infer its routes 
of spread in this work. The spreading and the dynamic of 
subgenotype F2a in Brazil requires the study of a higher 
number of samples from different regions as it is unfold in 
almost all Brazilian populations studied so far. However, 
these results showing for the first time the distribution of 
F2a subgenotype in Brazil. 

In conclusion, we identified three groups of sequences of 
subgenotype F2a: 1) 10 sequences from Sao Paulo state; 2) 
3 sequences from f Rio de Janeiro and one from Sao Paulo 
states; 3) 8 sequences from the West Amazon Basin. We 
cannot infer with certainty the origin of these different 
groups due to the lack of available sequences. Nevertheless, 
our data suggest that the common origin of these groups 
probably occurred a long time ago. In this sense, more 
studies should be carried out with HBV infected popula- 
tions in our country, particularly with indigenous popula- 
tions in other regions of our country, particularly in the 
regions where the indigenous populations were common 
at the time of the beginning of Brazils colonization by 
European populations. 
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